Перевод: с английского на русский

с русского на английский

expected reward

См. также в других словарях:

  • Risk/Reward Ratio — A ratio used by many investors to compare the expected returns of an investment to the amount of risk undertaken to capture these returns. This ratio is calculated mathematically by dividing the amount of profit the trader expects to have made… …   Investment dictionary

  • risk-reward ratio — Relationship of substantial reward corresponding to the amount of risk taken; mathematically represented by dividing the expected return by the standard deviation. Bloomberg Financial Dictionary …   Financial and business terms

  • Dopamine — For other uses, see Dopamine (disambiguation). Dopamine …   Wikipedia

  • Partially observable Markov decision process — A Partially Observable Markov Decision Process (POMDP) is a generalization of a Markov Decision Process. A POMDP models an agent decision process in which it is assumed that the system dynamics are determined by an MDP, but the agent cannot… …   Wikipedia

  • Consumer neuroscience — is the combination of consumer research with modern neuroscience. The goal of the field is to find neural explanations for consumer behaviors in both normal and diseased individuals. Contents 1 Consumer Research 2 Advertising 2.1 Advertising and… …   Wikipedia

  • Optimal stopping — In mathematics, the theory of optimal stopping is concerned with the problem of choosing a time to take a particular action, in order to maximise an expected reward or minimise an expected cost. Optimal stopping problems can be found in areas of… …   Wikipedia

  • Dynamic treatment regime — In medical research, a dynamic treatment regime (DTR) or adaptive treatment strategy is a set of rules for choosing effective treatments for individual patients. The treatment choices made for a particular patient are based on that individual s… …   Wikipedia

  • Bellman equation — A Bellman equation (also known as a dynamic programming equation), named after its discoverer, Richard Bellman, is a necessary condition for optimality associated with the mathematical optimization method known as dynamic programming. It writes… …   Wikipedia

  • Survivor: Samoa — Genre Reality television Winner Natalie White (7–2–0) No. of episodes 15 No. of days 39 No. of survivors 20 Trib …   Wikipedia

  • BELBIC — In recent years, the use of biologically inspired methods such as the evolutionary algorithm have been increasingly employed to solve and analyze complex computational problems. BELBIC (Brain Emotional Learning Based Intelligent Controller) is… …   Wikipedia

  • Scoring rule — In decision theory a score function, or scoring rule, is a measure of someone s performance when they are repeatedly making decisions under uncertainty. For example, a TV weather forecaster may give the probability of rain every day. A viewer… …   Wikipedia

Поделиться ссылкой на выделенное

Прямая ссылка:
Нажмите правой клавишей мыши и выберите «Копировать ссылку»